Who am I speaking at? Perceiving the head orientation of speakers from acoustic cues alone
نویسندگان
چکیده
The ability of people, and of machines, to determine the position of a sound source in a room is well studied. The related ability to determine the orientation of a directed sound source, on the other hand, is not, but the few studies there are show people to be surprisingly skilled at it. This has bearing for studies of face-to-face interaction and of embodied spoken dialogue systems, as sound source orientation of a speaker is connected to the head pose of the speaker, which is meaningful in a number of ways. We describe in passing some preliminary findings that led us onto this line of investigation, and in detail a study in which we extend an experiment design intended to measure perception of gaze direction to test instead for perception of sound source orientation. The results corroborate those of previous studies, and further show that people are very good at performing this skill outside of studio conditions as well.
منابع مشابه
Role of pitch in perceiving politeness in Korean
It has been found that Korean speakers lower their average voice pitch when speaking politely [16, 17], contradicting the idea that high pitch is polite across all cultures, as proposed by Ohala’s Frequency Code hypothesis [e.g., 12]. This study looks at pitch as a perceptual cue to politeness in Korean. Ten Korean listeners heard short utterances from eight different speakers and judged whethe...
متن کاملSpeaking Clearly for the Blind: Acoustic and Articulatory Correlates of Speaking Conditions in Sighted and Congenitally Blind Speakers
Compared to conversational speech, clear speech is produced with longer vowel duration, greater intensity, increased contrasts between vowel categories, and decreased dispersion within vowel categories. Those acoustic correlates are produced by larger movements of the orofacial articulators, including visible (lips) and invisible (tongue) articulators. Thus, clear speech provides the listener w...
متن کاملL2 Perception of English Fricatives in Clear and Conversational Speech: the Role of Phonemic, Phonetic, and Acoustic Factors
This study investigated perception by non-native listeners of English fricatives produced in clear and conversational speaking styles. We measured babble thresholds for fricative voicing and place of articulation contrasts by Standard German and Swabian German and native American English speakers. Overall, Swabian German speakers performed worse than both native English and Standard German spea...
متن کاملCortical magnetic responses for native and non-native speech sounds: MMNm induced by English /r/ and /l/
Native Japanese-speaking adults have difficulty perceiving the English /r/ and /l/ distinction since these exact sounds are not included in the Japanese phonetic system. To investigate the neural correlate of this phenomenon we recorded cortical magnetic responses (Magnetoencephalography, MEG) from American English (AE) and Japanese (J) speakers using /ra/ and /la/ sounds. A passive oddball par...
متن کاملAcoustic and articulatory correlates of speaking condition in blind and sighted speakers
Compared to conversational speech, clear speech is produced with longer vowel duration, greater intensity, increased contrasts between vowel categories, and decreased dispersion within vowel categories. Those acoustic correlates are produced by larger movements of the orofacial articulators, including visible (lips) and invisible (tongue) articulators. How are those cues produced by visually im...
متن کامل